智能论文笔记

RadFusion: Benchmarking Performance and Fairness for Multimodal Pulmonary Embolism Detection from CT and EHR

Yuyin Zhou , Shih-Cheng Huang , Jason Alan Fries , Alaa Youssef , Timothy J. Amrhein , Marcello Chang , Imon Banerjee , Daniel Rubin , Lei Xing , Nigam Shah

分类：计算机视觉

2021-11-23

尽管辐射学家常规使用电子健康记录（EHR）数据来形成临床历史并通知图像解释，但医学成像的大多数深度学习架构是单向的，即，它们只能从像素级信息中学习特征。最近的研究揭示了如何从像素数据中恢复种族，仅突出显示模型中的严重偏差的可能性，这未能考虑人口统计数据和其他关键患者属性。然而，缺乏捕获临床背景的成像数据集，包括人口统计学和纵向病史，具有偏远的多式化医学成像。为了更好地评估这些挑战，我们呈现RadFusion，一种多式联运，基准数据集1794名患者的相应EHR数据和高分辨率计算断层扫描（CT）扫描标记为肺栓塞。我们评估了几个代表性的多模式融合模型，并在受保护的亚组中，例如性别，种族/种族，年龄的年龄。我们的研究结果表明，集成成像和EHR数据可以提高分类性能和鲁棒性，而不会在人口群之间的真正阳性率下引入大的差异。

translated by 谷歌翻译

A Memetic Algorithm with Reinforcement Learning for Sociotechnical Production Scheduling

Felix Grumbach , Nour Eldin Alaa Badr , Pascal Reusch , Sebastian Trojahn

分类：机器学习 | 人工智能

2022-12-21

The following article presents a memetic algorithm with applying deep reinforcement learning (DRL) for solving practically oriented dual resource constrained flexible job shop scheduling problems (DRC-FJSSP). In recent years, there has been extensive research on DRL techniques, but without considering realistic, flexible and human-centered shopfloors. A research gap can be identified in the context of make-to-order oriented discontinuous manufacturing as it is often represented in medium-size companies with high service levels. From practical industry projects in this domain, we recognize requirements to depict flexible machines, human workers and capabilities, setup and processing operations, material arrival times, complex job paths with parallel tasks for bill of material (BOM) manufacturing, sequence-depended setup times and (partially) automated tasks. On the other hand, intensive research has been done on metaheuristics in the context of DRC-FJSSP. However, there is a lack of suitable and generic scheduling methods that can be holistically applied in sociotechnical production and assembly processes. In this paper, we first formulate an extended DRC-FJSSP induced by the practical requirements mentioned. Then we present our proposed hybrid framework with parallel computing for multicriteria optimization. Through numerical experiments with real-world data, we confirm that the framework generates feasible schedules efficiently and reliably. Utilizing DRL instead of random operations leads to better results and outperforms traditional approaches.

translated by 谷歌翻译

Effective Dynamics of Generative Adversarial Networks

Steven Durr , Youssef Mroueh , Yuhai Tu , Shenshen Wang

分类：机器学习 | (统计)机器学习

2022-12-08

Generative adversarial networks (GANs) are a class of machine-learning models that use adversarial training to generate new samples with the same (potentially very complex) statistics as the training samples. One major form of training failure, known as mode collapse, involves the generator failing to reproduce the full diversity of modes in the target probability distribution. Here, we present an effective model of GAN training, which captures the learning dynamics by replacing the generator neural network with a collection of particles in the output space; particles are coupled by a universal kernel valid for certain wide neural networks and high-dimensional inputs. The generality of our simplified model allows us to study the conditions under which mode collapse occurs. Indeed, experiments which vary the effective kernel of the generator reveal a mode collapse transition, the shape of which can be related to the type of discriminator through the frequency principle. Further, we find that gradient regularizers of intermediate strengths can optimally yield convergence through critical damping of the generator dynamics. Our effective GAN model thus provides an interpretable physical framework for understanding and improving adversarial training.

translated by 谷歌翻译

Adaptive Batch Normalization for Training Data with Heterogeneous Features

Wael Alsobhi , Tarik Alafif , Alaa Abdel-Hakim , Weiwei Zong

分类：机器学习

2022-11-03

Batch Normalization (BN) is an important preprocessing step to many deep learning applications. Since it is a data-dependent process, for some homogeneous datasets it is a redundant or even a performance-degrading process. In this paper, we propose an early-stage feasibility assessment method for estimating the benefits of applying BN on the given data batches. The proposed method uses a novel threshold-based approach to classify the training data batches into two sets according to their need for normalization. The need for normalization is decided based on the feature heterogeneity of the considered batch. The proposed approach is a pre-training processing, which implies no training overhead. The evaluation results show that the proposed approach achieves better performance mostly in small batch sizes than the traditional BN using MNIST, Fashion-MNIST, CIFAR-10, and CIFAR-100 datasets. Additionally, the network stability is increased by reducing the occurrence of internal variable transformation.

translated by 谷歌翻译

Speech Forensics: Blind Voice Mimicry Detection

Sahar Al Ajmi , Khizar Hayat , Alaa M. Al Obaidi , Naresh Kumar , Munaf Najmuldeen , Baptiste Magnier

分类：人工智能 | 机器学习 | 神经与进化计算

2022-09-26

音频是人类交流最常用的方式之一，但与此同时，它很容易被欺骗人们滥用。随着AI的革命，几乎每个人都可以访问相关技术，从而使罪犯犯罪和伪造变得简单。在这项工作中，我们引入了一种深度学习方法，以开发一种分类器，该分类器将盲目地将输入音频分类为真实或模仿。提出的模型接受了从大型音频数据集提取的一组重要功能的培训，以获取分类器，该分类器已在不同音频的相同功能上进行了测试。为这项工作创建了两个数据集；所有英语数据集和混合数据集（阿拉伯语和英语）。这些数据集已通过GitHub提供，可在https://github.com/sass7/dataset上使用研究社区。为了进行比较，还通过人类检查对音频进行了分类，主题是母语人士。随之而来的结果很有趣，并且表现出强大的精度。

translated by 谷歌翻译

Deep Learning on Home Drone: Searching for the Optimal Architecture

Alaa Maalouf , Yotam Gurfinkel , Barak Diker , Oren Gal , Daniela Rus , Dan Feldman

分类：计算机视觉 | 机器学习 | 机器人

2022-09-21

我们建议第一个通过对弱的微型计算机进行深入学习的实时语义细分的系统，例如Raspberry Pi Zero Zero V2（其价格\ 15美元）附加到玩具无人机上。特别是，由于Raspberry Pi的重量不到$ 16 $，并且其大小是信用卡的一半，因此我们可以轻松地将其连接到普通的商业DJI Tello玩具器中（<\ $ 100，<90克，98 $ \ \时间$ 92.5 $ \ times $ 41毫米）。结果是可以从板载单眼RGB摄像头（无GPS或LIDAR传感器）实时检测和分类对象的自动无人机（无笔记本电脑或人类）。伴侣视频展示了这款Tello无人机如何扫描实验室的人（例如使用消防员或安全部队）以及在实验室外的空停车位。现有的深度学习解决方案要么在这种物联网设备上实时计算要么太慢，要么提供不切实际的质量结果。我们的主要挑战是设计一个系统，该系统在网络，深度学习平台/框架，压缩技术和压缩比的众多组合中占有最好的选择。为此，我们提供了一种有效的搜索算法，旨在找到最佳组合，从而导致网络运行时间与其准确性/性能之间的最佳权衡。

translated by 谷歌翻译

Pruning Neural Networks via Coresets and Convex Geometry: Towards No Assumptions

Murad Tukan , Loay Mualem , Alaa Maalouf

分类：机器学习 | 人工智能

2022-09-18

修剪是压缩深神经网络（DNNS）的主要方法之一。最近，将核（可证明的数据汇总）用于修剪DNN，并增加了理论保证在压缩率和近似误差之间的权衡方面的优势。但是，该域中的核心是数据依赖性的，要么是在模型的权重和输入的限制性假设下生成的。在实际情况下，这种假设很少得到满足，从而限制了核心的适用性。为此，我们建议一个新颖而健壮的框架，用于计算模型权重的轻度假设，而没有对训练数据的任何假设。这个想法是计算每个层中每个神经元相对于以下层的输出的重要性。这是通过l \“ {o} wner椭圆形和caratheodory定理的组合来实现的。我们的方法同时依赖数据独立，适用于各种网络和数据集（由于简化的假设），以及在理论上支持的。方法的表现优于基于核心的现有神经修剪方法在广泛的网络和数据集上。例如，我们的方法在Imagenet上获得了$ 62 \％$的压缩率，ImageNet上的RESNET50的准确性下降了$ 1.09 \％$。

translated by 谷歌翻译

CGAN-ECT: Tomography Image Reconstruction from Electrical Capacitance Measurements Using CGANs

Wael Deabes , Alaa E. Abdel-Hakim

分类：人工智能 | 计算机视觉 | 机器学习

2022-09-07

由于电容层析成像（ECT）应用在几个工业领域的快速增长，因此从原始电容测量中开发出高质量但快速的图像重建方法的需求。深度学习是一种有效的非线性映射工具，用于复杂功能，在包括电断层扫描在内的许多领域都流行了。在本文中，我们提出了一个条件生成对抗网络（CGAN）模型，用于重建电容测量的ECT图像。 CGAN模型的初始图像是根据电容测量构建的。据我们所知，这是第一次以图像形式表示电容测量。我们创建了一个新的大规模ECT数据集，该数据集的320K合成图像测量对进行训练和测试所提出的模型。使用测试数据集，受污染的数据和流动模式评估所提出的CGAN-ECT模型的可行性和概括能力，这些数据集在训练阶段未暴露于模型。评估结果证明，与传统和其他基于学习的图像重建算法相比，提出的CGAN-ECT模型可以有效地创建更准确的ECT图像。 CGAN-ECT达到的平均图像相关系数超过99.3％，平均相对图像误差约为0.07。

translated by 谷歌翻译

Cloud-Based Real-Time Molecular Screening Platform with MolFormer

Brian Belgodere , Vijil Chenthamarakshan , Payel Das , Pierre Dognin , Toby Kurien , Igor Melnyk , Youssef Mroueh , Inkit Padhi , Mattia Rigotti , Jarret Ross

分类：机器学习

2022-08-13

随着自动化许多具有高保真性的化学任务的前景，化学语言处理模型正在快速迅速出现。在这里，我们提出了一个基于云的实时平台，该平台允许用户实际上筛选感兴趣的分子。为此，将杠杆化从最近提出的大型化学语言模型（名为Moleformer）推断出来的分子嵌入。该平台目前支持三个任务：最近的邻居检索，化学空间可视化和财产预测。根据该平台的功能并获得的结果，我们认为这样的平台可以在自动化化学和化学工程研究中起关键作用，并协助药物发现和材料设计任务。在\ url {www.ibm.biz/molecular_demo}提供我们平台的演示。

translated by 谷歌翻译

Edge-Based Self-Supervision for Semi-Supervised Few-Shot Microscopy Image Cell Segmentation

Youssef Dawoud , Katharina Ernst , Gustavo Carneiro , Vasileios Belagiannis

分类：计算机视觉 | 机器学习

2022-08-03

深层神经网络目前为显微镜图像细胞分割提供了令人鼓舞的结果，但是它们需要大规模标记的数据库，这是一个昂贵且耗时的过程。在这项工作中，我们通过将自我监督与半监督的学习相结合来放松标签要求。我们提出了基于边缘的地图的预测，以自我监督未标记的图像的训练，该图像与少数标记的图像的监督培训相结合，用于学习分割任务。在我们的实验中，我们在几次显微镜图像细胞分割基准上进行了评估，并表明只有少数注释的图像，例如原始训练集的10％足以让我们的方法与1到10次的完全注释的数据库达到类似的性能。我们的代码和训练有素的模型公开可用

translated by 谷歌翻译